| PRE-REQUISITE | DSAA 2031 AND UFUG 1601 |
|---|---|
| CROSS CAMPUS COURSE EQUIVALENCE | COMP 4651 |
| DESCRIPTION | Big data systems, including Cloud Computing and parallel data processing frameworks, emerge as enabling technologies in managing and mining the massive amount of data across hundreds or even thousands of commodity servers in datacenters. This course exposes students to both the theory and hands-on experience of this new technology. The course will cover the following topics. (1) Basic concepts of Cloud Computing and production Cloud services; (2) MapReduce - the de facto datacenter-scale programming abstraction - and its open source implementation of Hadoop. (3) Apache Spark - a new generation parallel processing framework - and its infrastructure, programming model, cluster deployment, tuning and debugging, as well as a number of specialized data processing systems built on top of Spark. By walking through a number of hands-on labs and assignments, students are expected to gain first-hand experience programming on real world clusters in production datacenters. |
| Section | Date & Time | Room | Instructor | Quota | Enrol | Avail | Wait | Remarks |
|---|---|---|---|---|---|---|---|---|
| L01 (6410) | Th 09:00AM - 11:50AM | Rm 102, W4 | TANG, Guoming | 40 | 0 | 40 | 0 | |
| LA01 (6411) | Fr 04:30PM - 05:20PM | Rm 150, E1 | TANG, Guoming | 40 | 0 | 40 | 0 |